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Clinical isolates of staphylococci (Staphylococcus aureus and S. epidermidis) which cause serious infec- 
tions due to their intrinsic resistance to beta-lactamase-stable beta-lactam antibiotics (e.g., methicillin) carry 
the mecA gene. Song et al., FEBS Lett. 221 :1 67-171 (1 987). This gene encodes a putative cell wall biosynthetic 
enzyme referred to as penicillin binding protein 2a (PBP2a). PBP2a, which binds beta-lactams only at concen- 
5 trations well above therapeutic efficacy, apparently can functionally substitute for all the staphylococcal PBPs 
and permit growth when the host organism is threatened by beta-lactams. Hartman and Tomasz, J. Bacterid. 
158:513-516 (1984). Wu et al., Antimicrob. Agents Chemother. 36:533-539 (1992) and Ryffel et al., Gene 
94:137-138 (1990). 

The mecA gene is not a normal part of the staphylococcal genome. The organism which donated mecA 

10 to the staphylococci remains unidentified. Despite the uniform presence of mecA in methicillin-resistant clinical 
isolates, these isolates vary considerably in their degree of resistance to methicillin. This variation in pheno- 
typic expression within a population has been referred to as heterogenous expression. Matthews and Stewart, 
FEMS Microbiol. Lett. 22:161-166 (1984). Typically, most cells exhibit low-level resistance to methicillin and 
only a minority of the population express high-level resistance, perhaps only one in 10 8 cells. Tomasz et al., 

15 Antimicrob. Agents Chemother. 35:124-129 (1991). Although expression of methicillin resistance is dependent 
upon the presence of FBP2a, it appears to be somewhat independent of the amount of PBP2a, suggesting 
important roles for other factors. Chambers and Hackbarth, Antimicrob. Agents Chemother. 31:1982-1988 
(1987) and Murakami and Tomasz, J. BacterioL 171:874-879 (1989). 

Tn55T insertional mutagenesis of methicillin-resistant S. aureus revealed numerous sites which influence 

20 the level of methicillin resistance but are not linked to mecA and do not perturb the expression of PBP2a. Berg- 
er-Bachi et al., Antimicrob. Agents Chemother. 36:1367-1373 (1992); Kornblum et al., Eur. J. Clin. Microbiol. 
5:714-718 (1986); Berger-Bachi et al., Mol. Gen. Genet. 219:263-269 (1989) and Maidhof et al., J. BacterioL 
173:3507-3513 (1991). Those factors described thus far generally depress the MIC of beta-lactam resistant 
strains. Some of the genetic loci which demonstrate such an effect on methicillin resistance were designated 

25 factors essential for methicillin resistance (fern). Berger-Bachi et al., Mol. Gen. Genet 219:263-269 (1989). In 
contrast to mecA, the genes which encode influential factors are probably present in both resistant and sus- 
ceptible strains of S. aureus and S. epidermidis. Information obtained from gene disruption studies of femA 
and femB in S. aureus indicated that in addition to enhanced sensitivity to methicillin, homogeneously methi- 
cillin-resistant S. aureus strains carrying such gene disruptions have a reduced glycine content in the pepti- 

30 doglycan component of their cell walls (Maidhof et al., J. BacterioL 1 73:3507-351 3 (1 991 )) and exhibit reduced 
rates of cell wall turnover and autolysis (de Jongeet al., J. BacterioL 173:1105-1110 (1991)). 

Genetic factors, other than mecA, that influence the expression of methicillin resistance in S. epidermidis 
have, until now, not been described at the molecular level. The present invention provides DNA sequences 
encoding the FemA protein of Staphylococcus epidermidis, the FemA protein itself, and vectors and microor- 

35 ganisms comprising the femA gene of S. epidermidis. 

The present invention provides DNA sequences encoding the FemA protein of Staphylococcus epidermi- 
dis, including the natural gene sequence designated femA (SEQ ID NO:1). Thus, included in the present in- 
vention is any DNA compound that comprises an isolated DNA sequence encoding SEQ ID NO:2. SEQ ID NO:2 
is as follows: 
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Met Lys Met Lys Phe Thr Asn Leu Thr Ala Lys Glu Phe Ser Asp Phe 
15 10 15 

Thr Asp Arg Met Thr Tyr Ser His Phe Thr Gin Met Glu Gly Asn Tyr 
20 25 30 

Glu Leu Lys Val Ala Glu Gly Thr Glu Ser His Leu Val Gly He Lys 
35 40 45 

Asn Asn Asp Asn Glu Val He Ala Ala Cys Leu Leu Thr Ala Val Pro 
50 55 60 

Val Met Lys He Phe Lys Tyr Phe Tyr Ser Asn Arg Gly Pro Val He 
65 70 75 80 

Asp Tyr Asn Asn Lys Glu Leu Val His Phe Phe Phe Asn Glu Leu Ser 
85 90 95 

Lys Tyr Val Lys Lys Tyr Asn Cys Leu Tyr Leu Arg Val Asp Pro Tyr 
20 100 105 110 

Leu Pro Tyr Gin Tyr Leu Asn His Glu Gly Glu He Thr Gly Asn Ala 
115 120 125 



15 



25 



Gly His Asp Trp He Phe Asp Glu Leu Glu Ser Leu Gly Tyr Lys His 
130 135 140 

Glu Gly Phe His Lys Gly Phe Asp Pro Val Leu Gin He Arg Tyr His 
145 150 155 160 

30 Ser Val Leu Asn Leu Ala Asn Lys Ser Ala Asn Asp Val Leu Lys Asn 

165 170 175 

Met Asp Gly Leu Arg Lys Arg Asn Thr Lys Lys Val Lys Lys Asn Gly 
180 185 190 



35 



40 



Val Lys Val Arg Phe Leu Ser Glu Glu Glu Leu Pro He Phe Arg Ser 
195 200 205 

Phe Met Glu Asp Thr Ser Glu Thr Lys Asp Phe Ala Asp Arg Glu Asp 

210 215 220 

Ser Phe Tyr Tyr Asn Arg Phe Lys His Tyr Lys Asp Arg Val Leu Val 
225 230 235 240 



45 



50 



55 



3 



S DOC ID: <EP 0625575A2_I_> 



EP 0 625 575 A2 



Pro Leu Ala Tyr He Asn Phe Asp Glu Tyr He Glu Glu Leu Asn Asn 
245 250 255 

Glu Arg Asn Val Leu Asn Lys Asp Tyr Asn Lys Ala Leu Lys Asp He 
260 265 270 

Glu Lys Arg Pro Glu Asn Lys Lys Ala His Asn Lys Lys Glu Asn Leu 
275 280 285 

Glu Gin Gin Leu Asp Ala Asn Gin Gin Lys lie Asn Glu Ala Lys Asn 
290 295 300 

Leu Lys Gin Glu His Gly Asn Glu Leu Pro He Ser Ala Gly Phe Phe 
305 310 315 320 

15 He He Asn Pro Phe Glu Val Val Tyr Tyr Ala Gly Gly Thr Ser Asn 

325 330 335 

Arg Tyr Arg His Phe Ala Gly Ser Tyr Ala Val Gin Trp Lys Met He 
340 345 350 
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25 



30 



Asn Tyr Ala lie Glu His Gly He Asn Arg Tyr Asn Phe Tyr Gly He 
355 360 365 

Ser Gly Asp Phe Ser Glu Asp Ala Glu Asp Ala Gly Val Val Lys Phe 
370 375 380 

Lys Lys Gly Tyr Asp Ala Asp Val He Glu Tyr Val Gly Asp Phe He 
385 390 395 400 

Lys Pro He Asn Lys Pro Met Tyr Asn He Tyr Arg Thr Leu Lys Lys 
405 410 415 

Leu Lys Lys 



The natural femA sequence is encompassed by the present invention as a DNA compound which com- 
35 prises the isolated DNA sequence which is SEQ ID NO:1 . SEQ ID NO:1 is as follows: 



ATG AAG ATG AAG TTT ACG AAT TTG ACA GCT AAA GAA TTT AGT GAC TTT 48 

Met Lys Met Lys Phe Thr Asn Leu Thr Ala Lys Glu Phe Ser Asp Phe 

1 5 10 15 

ACT GAT CGT ATG ACA TAT AGT CAT TTT ACA CAA ATG GAA GGT AAT TAC 9S 

Thr Asp Arg Met Thr Tyr Ser His Phe Thr Gin Met Glu Gly Asn Tyr 
20 25 30 
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GAA TTA AAG GTT GCT GAA GGT ACC GAG TCA CAT TTA GTT GGA ATT AAA 144 
Glu Leu Lys Val Ala Glu Gly Thr Glu Ser His Leu Val Gly lie Lys 
35 40 45 

AAT AAT GAT AAC GAA GTG ATT GCA GCT TGT TTA TTA ACA GCT GTT CCT 192 
Asn Asn Asp Asn Glu Val lie Ala Ala Cys Leu Leu Thr Ala Val Pro 
50 55 60 

GTA ATG AAA ATA TTT AAA TAT TTT TAT TCC AAT CGC GGT CCA GTA ATA 240 
Val Met Lys lie Phe Lys Tyr Phe Tyr Ser Asn Arg Gly Pro Val He 
65 70 75 80 

GAT TAT AAT AAT AAA GAG CTT GTA CAT TTT TTC TTT AAT GAA TTG AGT 288 
Asp Tyr Asn Asn Lys Glu Leu Val His Phe Phe Phe Asn Glu Leu Ser 
85 90 95 

AAA TAT GTA AAA AAA TAT AAT TGT TTA TAT TTA AGA GTT GAC CCA TAC 336 
Lys Tyr Val Lys Lys Tyr Asn Cys Leu Tyr Leu Arg Val Asp Pro Tyr 
100 105 110 

CTT CCA TAT CAA TAT TTA AAT CAT GAG GGA GAA ATA ACT GGA AAT GCA 384 
Leu Pro Tyr Gin Tyr Leu Asn His Glu Gly Glu He Thr Gly Asn Ala 
115 120 125 

GGT CAT GAT TGG ATT TTT GAT GAA TTA GAG AGT TTA GGA TAT AAA CAC 432 
Gly His Asp Trp He Phe Asp Glu Leu Glu Ser Leu Gly Tyr Lys His 
130 135 140 

GAA GGA TTC CAC AAA GGA TTT GAT CCT GTA TTA CAA ATC CGA TAT CAT 480 
Glu Gly Phe His Lys Gly Phe Asp Pro Val Leu Gin He Arg Tyr His 
145 150 155 160 

TCT GTT CTA AAT TTA GCA AAC AAA AGT GCT AAT GAT GTT TTA AAA AAC 528 
Ser Val Leu Asn Leu Ala Asn Lys Ser Ala Asn Asp Val Leu Lys Asn 
165 170 175 

ATG GAT GGT TTA AGA AAG CGT AAT ACT AAA AAA GTT AAG AAA AAT GGA 576 
Met Asp Gly Leu Arg Lys Arg Asn Thr Lys Lys Val Lys Lys Asn Gly 
180 185 190 

GTT AAA GTC CGC TTT TTA TCT GAA GAA GAG TTA CCT ATA TTT AGG TCA 624 
Val Lys Val Arg Phe Leu Ser Glu Glu Glu Leu Pro He Phe Arg Ser 
195 200 205 

TTT ATG GAG GAT ACC TCT GAA ACT AAA GAT TTT GCA GAT AGA GAA GAT 672 
Phe Met Glu Asp Thr Ser Glu Thr Lys Asp Phe Ala Asp Arg Glu Asp 
210 215 220 

AGT TTT TAT TAC AAC AGA TTC AAA CAT TAT AAA GAC CGT GTT TTA GTA 720 
Ser Phe Tyr Tyr Asn Arg Phe Lys His Tyr Lys Asp Arg Val Leu Val 
225 230 235 240 

CCA CTA GCC TAT ATT AAC TTT GAT GAG TAT ATA GAG GAA CTA AAT AAT 768 
Pro Leu Ala Tyr He Asn Phe Asp Glu Tyr He Glu Glu Leu Asn Asn 
245 250 255 
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GAA AGA AAT GTG CTT AAT AAA GAT TAT AAT AAA GCT TTA AAA GAC ATT 816 
Glu Arg Asn Val Leu Asn Lys Asp Tyr Asn Lys Ala Leu Lys Asp He 
260 265 270 

GAG AAA CGT CCA GAG AAT AAA AAA GCA CAT AAC AAA AAG GAA AAT TTA 864 
Glu Lys Arg Pro Glu Asn Lys Lys Ala His Asn Lys Lys Glu Asn Leu 
275 280 285 

GAA CAA CAA CTC GAT GCA AAT CAG CAA AAA ATT AAT GAA GCT AAA AAC 912 
Glu Gin Gin Leu Asp Ala Asn Gin Gin Lys He Asn Glu Ala Lys Asn 
290 295 300 

TTA AAA CAA GAA CAT GGC AAT GAA TTA CCC ATC TCT GCT GGC TTC TTT 960 
Leu Lys Gin Glu His Gly Asn Glu Leu Pro He Ser Ala Gly Phe Phe 
305 310 315 320 

ATA ATT AAT CCG TTT GAA GTA GTT TAC TAC GCT GGT GGA ACT TCA AAT 1008 
He He Asn Pro Phe Glu Val Val Tyr Tyr Ala Gly Gly Thr Ser Asn 
325 330 335 

CGT TAT CGC CAT TTT GCA GGG AGC TAT GCG GTT CAA TGG AAG ATG ATT 1056 
Arg Tyr Arg His Phe Ala Gly Ser Tyr Ala Val Gin Trp Lys Met He 
340 345 350 

AAC TAT GCA ATT GAA CAT GGT ATT AAT CGG TAT AAT TTC TAT GGT ATT 1104 
Asn Tyr Ala He Glu His Gly He Asn Arg Tyr Asn Phe Tyr Gly He 
355 360 365 

AGT GGT GAC TTT AGT GAA GAT GCT GAA GAT GCT GGC GTA GTT AAG TTT 1152 
Ser Gly Asp Phe Ser Glu Asp Ala Glu Asp Ala Gly Val Val Lys Phe 
370 375 380 

AAA AAG GGC TAT GAT GCC GAT GTT ATA GAA TAC GTT GGT GAC TTT ATT 1200 
Lys Lys Gly Tyr Asp Ala Asp Val He Glu Tyr Val Gly Asp Phe He 
385 390 395 400 

AAA CCT ATT AAT AAA CCA ATG TAT AAC ATT TAT AGA ACA CTT AAA AAA 1248 
Lys Pro He Asn Lys Pro Met Tyr Asn He Tyr Arg Thr Leu Lys Lys 
405 410 415 

CTA AAG AAA 1257 
Leu Lys Lys 



The present invention also includes the protein encoded by SEQ ID NO: 1 in purified form. Also included 
are recombinant DNA vectors, including expression vectors, that comprise DNA sequences encoding FemA. 

The restriction site and function maps presented in the accompanying drawings are approximate repre- 
sentations of the recombinant DNA vectors discussed herein. The restriction site information is not exhaustive; 
45 therefore, there may be more restriction sites of a given type on the vector than actually shown on the map. 

Figure 1 - A restriction site and function map of plasmid pPSJ1 80. 

Figure 2 - A restriction site and function map of plasmid pET-11 A. 

The instant invention provides the femA gene of Staphylococcus epidermidis and all degenerate sequenc- 
es thereof, the protein encoded by the femA gene (FemA), and vectors and microorganisms comprising genes 
so encoding the FemA protein. In the practice of the invention as exemplif ied herein, the FemA protein comprises 
the amino acid sequence, which is SEQ ID NO 2: 
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Met Lys Met Lys Phe Thr Asn Leu Thr Ala Lys Glu Phe Ser Asp Phe 
15 10 15 

Thr Asp Arg Met Thr Tyr Ser His Phe Thr Gin Met Glu Gly Asn Tyr 
20 25 30 

Glu Leu Lys Val Ala Glu Gly Thr Glu Ser His Leu Val Gly lie Lys 
35 40 45 

Asn Asn Asp Asn Glu Val lie Ala Ala Cys Leu Leu Thr Ala Val Pro 
50 55 60 

Val Met Lys lie Phe Lys TYr Phe Tyr Ser Asn Arg Gly Pro Val lie 
65 70 75 80 

Asp Tyr Asn Asn Lys Glu Leu Val His Phe Phe Phe Asn Glu Leu Ser 
85 90 95 

Lys Tyr Val Lys Lys Tyr Asn Cys Leu Tyr Leu Arg Val Asp Pro lyr 
100 105 110 

Leu Pro Tyr Gin Tyr Leu Asn His Glu Gly Glu lie Thr Gly Asn Ala 
115 120 125 

Gly His Asp Trp lie Phe Asp Glu Leu Glu Ser Leu Gly Tyr Lys His 
25 130 135 140 

Glu Gly Phe His Lys Gly Phe Asp Pro Val Leu Gin lie Arg Tyr His 
145 150 155 160 

30 



20 



35 



40 



45 



55 



7 



iDOCID: <EP 0625575A2J_> 



EP0 625 575 A2 



10 



15 



Ser Val Leu Asn Leu Ala Asn Lys Ser Ala Asn Asp Val Leu Lys Asn 
165 170 175 

Met Asp Gly Leu Arg Lys Arg Asn Thr Lys Lys Val Lys Lys Asn Gly 
180 185 190 

Val Lys Val Arg Phe Leu Ser Glu Glu Glu Leu Pro He Phe Arg Ser 
195 200 205 

Phe Met Glu Asp Thr Ser Glu Thr Lys Asp Phe Ala Asp Arg Glu Asp 
210 215 220 

Ser Phe Tyr Tyr Asn Arg Phe Lys His Tyr Lys Asp Arg Val Leu Val 
225 230 235 240 

Pro Leu Ala Tyr He Asn Phe Asp Glu Tyr He Glu Glu Leu Asn Asn 
245 250 255 

Glu Arg Asn Val Leu Asn Lys Asp Tyr Asn Lys Ala Leu Lys Asp He 
260 265 270 

Glu Lys Arg Pro Glu Asn Lys Lys Ala His Asn Lys Lys Glu Asn Leu 
275 280 285 

Glu Gin Gin Leu Asp Ala Asn Gin Gin Lys He Asn Glu Ala Lys Asn 
290 295 300 

Leu Lys Gin Glu His Gly Asn Glu Leu Pro He Ser Ala Gly Phe Phe 
305 310 315 320 

He He Asn Pro Phe Glu Val Val Tyr Tyr Ala Gly Gly Thr Ser Asn 
30 325 330 335 

Arg Tyr Arg His Phe Ala Gly Ser Tyr Ala Val Gin Trp Lys Met He 
340 345 350 



20 



25 



35 



Asn Tyr Ala He Glu His Gly He Asn Arg Tyr Asn Phe Tyr Gly He 
355 360 365 

Ser Gly Asp Phe Ser Glu Asp Ala Glu Asp Ala Gly Val Val Lys Phe 
370 375 380 

40 Lys Lys Gly Tyr Asp Ala Asp Val He Glu Tyr Val Gly Asp Phe He 

385 390 395 400 

Lys Pro He Asn Lys Pro Met Tyr Asn He Tyr Arg Thr Leu Lys Lys 
405 410 415 



45 



50 



Leu Lys Lys 

The present invention also provides the natural femA gene found in Staphylococcus epidermidis, embod- 
ied in SEQ ID NO: 1: 
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ATG AAG ATG AAG TTT ACG AAT TTG ACA GCT AAA GAA TTT AGT GAC TTT 48 
Met Lys Met Lys Phe Thr Asn Leu Thr Ala Lys Glu Phe Ser Asp Phe 
15 10 15 

ACT GAT CGT ATG ACA TAT AGT CAT TTT ACA CAA ATG GAA GGT AAT TAC 96 
Thr Asp Arg Met Thr Tyr Ser His Phe Thr Gin Met Glu Gly Asn Tyr 
20 25 30 

GAA TTA AAG GTT GCT GAA GGT ACC GAG TCA CAT TTA GTT GGA ATT AAA 144 
Glu Leu Lys Val Ala Glu Gly Thr Glu Ser His Leu Val Gly lie Lys 
35 40 45 

AAT AAT GAT AAC GAA GTG ATT GGA GCT TGT TTA TTA ACA GCT GTT CCT 192 
Asn Asn Asp Asn Glu Val lie Ala Ala Cys Leu Leu Thr Ala Val Pro 
50 55 60 

GTA ATG AAA ATA TTT AAA TAT TTT TAT TCC AAT CGC GGT CCA GTA ATA 240 
Val Met Lys lie Phe Lys Tyr Phe Tyr Ser Asn Arg Gly Pro Val lie 
65 70 75 80 

GAT TAT AAT AAT AAA GAG CTT GTA CAT TTT TTC TTT AAT GAA TTG AGT 288 
Asp Tyr Asn Asn Lys Glu Leu Val His Phe Phe Phe Asn Glu Leu Ser 
85 90 95 

AAA TAT GTA AAA AAA TAT AAT TGT TTA TAT TTA AGA GTT GAC CCA TAC 336 
Lys Tyr Val Lys Lys Tyr Asn Cys Leu Tyr Leu Arg Val Asp Pro Tyr 
100 105 110 

CTT CCA TAT CAA TAT TTA AAT CAT GAG GGA GAA ATA ACT GGA AAT GCA 384 
Leu Pro Tyr Gin Tyr Leu Asn His Glu Gly Glu lie Thr Gly Asn Ala 
115 120 125 

GGT CAT GAT TGG ATT TTT GAT GAA TTA GAG AGT TTA GGA TAT AAA CAC 432 
Gly His Asp Trp lie Phe Asp Glu Leu Glu Ser Leu Gly Tyr Lys His 
130 135 140 

GAA GGA TTC CAC AAA GGA TTT GAT CCT GTA TTA CAA ATC CGA TAT CAT 480 
Glu Gly Phe His Lys Gly Phe Asp Pro Val Leu Gin lie Arg Tyr His 
145 150 155 160 

TCT GTT CTA AAT TTA GCA AAC AAA AGT GCT AAT GAT GTT TTA AAA AAC 528 
Ser Val Leu Asn Leu Ala Asn Lys Ser Ala Asn Asp Val Leu Lys Asn 
165 170 175 

ATG GAT GGT TTA AGA AAG CGT AAT ACT AAA AAA GTT AAG AAA AAT GGA 576 
Met Asp Gly Leu Arg Lys Arg Asn Thr Lys Lys Val Lys Lys Asn Gly 
180 185 190 
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GTT AAA GTC CGC TTT TTA TCT GAA GAA GAG TTA CCT ATA TTT AGG TCA 624 
Val Lys Val Arg Phe Leu Ser Glu Glu Glu Leu Pro He Phe Arg Ser 
195 200 205 

TTT ATG GAG GAT ACC TCT GAA ACT AAA GAT TTT GCA GAT AGA GAA GAT 672 
Phe Met Glu Asp Thr Ser Glu Thr Lys Asp Phe Ala Asp Arg Glu Asp 
210 215 220 

AGT TTT TAT TAC AAC AGA TTC AAA CAT TAT AAA GAC CGT GTT TTA GTA 720 
Ser Phe Tyr Tyr Asn Arg Phe Lys His Tyr Lys Asp Arg Val Leu Val 
225 230 235 240 



CCA CTA GCC TAT ATT AAC TTT GAT GAG TAT ATA GAG GAA CTA AAT AAT 768 
Pro Leu Ala Tyr He Asn Phe Asp Glu Tyr He Glu Glu Leu Asn Asn 
15 245 250 255 

GAA AGA AAT GTG CTT AAT AAA GAT TAT AAT AAA GCT TTA AAA GAC ATT 816 
Glu Arg Asn Val Leu Asn Lys Asp Tyr Asn Lys Ala Leu Lys Asp He 
260 265 270 

20 GAG AAA CGT CCA GAG AAT AAA AAA GCA CAT AAC AAA AAG GAA AAT TTA 864 

Glu Lys Arg Pro Glu Asn Lys Lys Ala His Asn Lys Lys Glu Asn Leu 
275 280 285 

GAA CAA CAA CTC GAT GCA AAT CAG CAA AAA ATT AAT GAA GCT AAA AAC 912 
25 Glu Gin Gin Leu Asp Ala Asn Gin Gin Lys He Asn Glu Ala Lys Asn 

290 295 300 

TTA AAA CAA GAA CAT GGC AAT GAA TTA CCC ATC TCT GCT GGC TTC TTT 960 
Leu Lys Gin Glu His Gly Asn Glu Leu Pro He Ser Ala Gly Phe Phe 
305 310 315 320 



ATA ATT AAT CCG TTT GAA GTA GTT TAC TAC GCT GGT GGA ACT TCA AAT 1008 
He He Asn Pro Phe Glu Val Val Tyr Tyr Ala Gly Gly Thr Ser Asn 
325 330 335 

CGT TAT CGC CAT TTT GCA GGG AGC TAT GCG GTT CAA TGG AAG ATG ATT 1056 
Arg Tyr Arg His Phe Ala Gly Ser Tyr Ala Val Gin Trp Lys Met He 
340 345 350 



AAC TAT GCA ATT GAA CAT GGT ATT AAT CGG TAT AAT TTC TAT GGT ATT 1104 
Asn Tyr Ala He Glu His Gly He Asn Arg Tyr Asn Phe Tyr Gly He 
40 355 360 365 

AGT GGT GAC TTT AGT GAA GAT GCT GAA GAT GCT GGC GTA GTT AAG TTT 1152 
Ser Gly Asp Phe Ser Glu Asp Ala Glu Asp Ala Gly Val Val Lys Phe 
370 375 380 



AAA AAG GGC TAT GAT GCC GAT GTT ATA GAA TAC GTT GGT GAC TTT ATT 1200 
Lys Lys Gly Tyr Asp Ala Asp Val He Glu Tyr Val Gly Asp Phe He 
385 390 395 400 



AAA CCT ATT AAT AAA CCA ATG TAT AAC 
Lys Pro He Asn Lys Pro Met Tyr Asn 
405 

CTA AAG AAA 1257 
Leu Lys Lys 



ATT TAT AGA ACA CTT AAA AAA 1248 
He Tyr Arg Thr Leu Lys Lys 
410 415 
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The synthesis of the FemA protein of the present invention may proceed by solid phase peptide synthesis or 
by recombinant methods. Both methods are described in U.S. Patent No. 4,617,149, the entire teaching of 
which is herein incorporated by reference. Recombinant methods are preferred if a high yield is desired. The 
principles of solid phase chemical synthesis of polypeptides are well known in the art and may be found in gen- 
5 eral texts in the area such as Dugas, H. and Penney, C, Bioorganic Chemistry (1981), Springer- Verlag, New 
York, pgs. 54-92. 

Synthesis of the FemA protein can be achieved by recombinant DNA technology. Synthetic genes, the in 
vitro or in vivo transcription and translation of which will result in the production of the FemA protein may be 
constructed by techniques well known in the art Owing to the natural degeneracy of the genetic code, the skil- 

10 led artisan will recognize that a sizable yet definite number of DNA sequences may be constructed which en- 
code the FemA protein. All such genes are provided by the present invention. A preferred gene encoding the 
FemA protein is the natural femA gene of Staphylococcus epidermidis, which is SEQ ID NO: 1 . This preferred 
femA gene is available on an ~~ 3.7 kb EcoRI restriction fragment of plasmid pPSJ180, publicly available and 
on deposit in Escherichia coii DH5a at the National Center for Agricultural Utilization Research, 1815 North 

15 University Street Peoria, Illinois 61604-39999, under accession number NRRL B-21024 (date of deposit De- 
cember 8, 1992). A restriction site and function map of pPSJ180 is provided in Figure 1 of the drawings. 

The femA gene may be created by synthetic methodology. Such methodology of synthetic-gene construc- 
tion is well known in the art. See Brown et al. (1979) Methods in Enzymoiogy t Academic Press, N.Y., 68:109- 
151. The femA DNA sequence may be generated using a conventional DNA synthesizing apparatus such as 

20 the Applied Biosystems Model 380A or 380B DNA synthesizers (commercially available from Applied Biosys- 
tems. Inc., 850 Lincoln Center Drive, Foster City, CA 94404). 

To effect the translation of the FemA protein, one inserts the engineered synthetic DNA sequence in any 
of a large number of appropriate recombinant DNA expression vectors through the use of appropriate restriction 
endonucleases and DNA ligase. The synthetic femA gene should be designed to possess restriction endonu- 

25 clease cleavage sites at either end of the transcript to facilitate isolation from and integration into these am- 
plification and expression plasmids. The particular endonucleases employed will be dictated by the restriction 
endonuclease cleavage pattern of the parent expression vector to be employed. The choice of restriction sites 
are chosen so as to properly orient the FemA coding sequence with control sequences to achieve proper in- 
frame reading and expression of the FemA molecule. The FemA coding sequence must be positioned so as 

30 to be in proper reading frame with the promoter and ribosome binding site of the expression vector, both of 
which are functional in the host cell in which the FemA protein is to be expressed. The FemA protein may be 
expressed in any number of well-known eucaryotic or procaryotic hosts using known promoters and vectors. 
Some of the potential hosts, in addition to E. coli, include the yeasts Saccharomyces cerevisiae and Pichia 
pastoris, Bacillus, and cells infected with baculovirus. 

35 To achieve efficient transcription of the synthetic gene, said gene must be operably associated with a pro- 

moter operator region. In one practice of the invention, the promoter-operator region of the synthetic gene en- 
coding SEQ ID NO: 2 is placed in the same sequential orientation with respect to the ATG start codon of the 
synthetic gene as the promoter-operator occupies with respect to the ATG-start codon of the gene from which 
it was derived. Synthetic or modified promoter operator regions have been created and are well known in the 

40 art. When employing such synthetic or modified promoter-operator regions they should be oriented with re- 
spect to the ATG-start codon of the femA gene as directed by their creators. In one practice of the invention 
as exemplified herein, where the host cell is an E. Coli host cell, said promoter-operator region is the phage 
T7 promoter-operator region. 

A variety of expression vectors useful for transforming procaryotic cells are well known in the art A pre- 

45 ferred vector for expression in an E. coli host cell is derived from E. coli plasmid pET-11 A which comprises 
the phage T7 promoter. A restriction site and function map of pET-1 1 A appears in Figure 2 of the accompanying 
drawings. Plasmid pET-11A is publicly available from Novagen, Inc. (565 Science Drive, Madison, Wl 53711) 
under catalog #69436-1. The preferred host strain is E. coli BL21(DE3), also available from Novagen under 
catalog #69387-1. 

so The techniques of transforming cells with the aforementioned vectors are well known in the art and may 

be found in such general references as Sambrook et al., Molecular Cloning: A Laboratory Manual ( 1 988), Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, NY or Ausubel etal., Current Protocols in Molecular Biol- 
ogy (1 989), John Wiley & Sons, New York, NY and supplements. The techniques involved in the transformation 
of E. coli cells used in the preferred practice of the invention as exemplified herein are well known in the art. 

55 The precise conditions under which the transformed E. coli cells are cultured is dependent on the nature of 
the E. coli host cell line and the expression or cloning vectors employed. For example, vectors which incorporate 
thermoinducible promoter-operator regions, such as the cl857 thermoinducible lambda-phage promoter-oper- 
ator region, require a temperature shift from about 30 to about 40°C. in the culture conditions so as to induce 
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protein synthesis. 

In a preferred embodiment of the invention E. coli K12 BL21 (DE3) ceDs were employed as host cells but 
numerous other cell lines are available. The transformed host cells are then plated on appropriate media under 
the selective pressure of the antibiotic corresponding to the resistance gene present on the expression plasmid. 

5 The cultures are then incubated for a time and temperature appropriate to the host cell line employed. Spe- 
cifically, with E. coli KM BL21 (DE3) cells, the femA gene is placed under the control of a promoter transcribed 
specifically by the T7 RNA polymerase. Induction of transcription of the femA gene is accomplished by the 
addition of isopropylthiogalactoside (IPTG) to the growth medium, which induces expression of the 17 RNA 
polymerase gene under the control of the lacUVS promoter. The T7 RNA polymerase is then available to tran- 

10 scribe the femA gene. 

General techniques of protein purification are well-known to those of ordinary skill in the art. See Creight- 
on, T.E., Protein Structure: A Practical Approach (1 989), IRL Press, Oxford, England and Bollag, DM and Edel- 
stein, S.J., Protein Methods (\ 991 ), Wiley-Liss, New York, NY. Proteins which are expressed in high-level bac- 
terial expression systems characteristically aggregate in granules or inclusion bodies which contain high levels 

is of the overexpressed protein. Kreuger et al. (1990) in Protein Folding, Gierasch and King, eds., pgs 136-142, 
American Association for the Advancement of Science Publication No. 89-1 8S, Washington, D.C. and Sam- 
brook et al., Molecular Cloning: A Laboratory Manual (1 988), pp. 1 7.37-1 7.41 , Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY. The FemA protein sometimes aggregates into inclusion bodies when expressed 
under the control of phage T7 promoter. Such protein aggregates must be sotubilized to provide further puri- 

20 f icatbn and isolation of the desired protein product. A variety of techniques using strongly denaturing solutions 
such as guanidinium-HCI and/or weakly denaturing solutions such as dithiothreltol (DTT) are used to solubilize 
the proteins. 

Recombinants produced proteins may be purified by a variety of techniques well known in the art such 
as ion exchange chromatography, sire exclusion chromatography, electrophoresis, differential centrif ugation, 
25 reversed phase high performance liquid chromatography, immunoaffinity chromatography, and the like. Pro- 
tocols for use of these individual techniques or combinations thereof are well known in the art. Gradual removal 
of the denaturing agents (often by dialysis) in a refolding solution allows the denatured protein to assume its 
native conformation. The particular conditions for denaturation and refolding are determined by the particular 
protein expression system and/or protein in question. The S-sulfonates of the peptide molecules are converted 
30 to the disulfide paired, folded FemA molecules using a combination of high pH and added thiol in substantial 
accordance with the teaching of Frank, B.H. et al., (1981) in Peptides. Synthesis, Structure and Function. Pro- 
ceedings of the Seventh American Peptide Symposium (Rich, D.H. and Gross, E.» eds.) pp. 729-738, Pierce 
Chemical Co., Rockford, IL. 

The femA gene may be used in gene disruption studies in Staphylococcus epidermidis. Although it is be- 
ds lieved that the FemA protein is involved in the formation of a pentaglycine bridge in the cell wall of the bacterium, 
gene disruption will allow one to ascertain the precise effect of the loss of the femA gene. Gene disruption 
experiments in Staphylococcus aureus have revealed that a loss of femA results in an -40% reduction in cell 
wall glycine content A similar result might be anticipated for S. epidermidis. Once determined, this information 
can be used to generate an assay for agents which inhibit the FemA protein, and are therefore useful in com- 
40 bination with antibiotics to treat methicillin-resistant bacteria. 

The FemA protein of SEQ ID NO: 2 may be produced by recombinant methods. Recombinant methods 
are preferred if a high yield is desired. The present invention thus comprises a method for constructing a re- 
combinant host cell capable of expressing SEQ ID NO: 2, said method comprising transforming a host cell with 
a recombinant DNA vector that comprises an isolated DNA sequence of Claim 1. The present invention also 
45 comprises a method for expressing SEQ ID NO: 2 in a recombinant host ceil; said method comprising culturing 
said transformed host cell of Claim 5 under conditions suitable for gene expression. 

The following Examples are provided to further illustrate and exemplify, but not limit the scope of, the in- 
vention. 

so Example 1 

Source of the Staphylococcus epidermidis femA gene 

Isolation of Plasmid pPSJ180 

55 

Alyophil of E. coli K12 DH5a/pPSJ180 can be obtained from the Northern Regional Research Laboratories 
(NRRL), Peoria, Illinois 61604, under the accession number NRRL B-21024 (date of deposit December 8, 
1992). The pPSJ180 plasmid may be isolated from E. coli K12 DH5ct/pPSJ180 using techniques well-known 
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to those skilled in the art. See Sambrook et al. v Molecular Cloning: A Laboratory Manual (1988), Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY or Ausubel et al., Current Protocols in Molecular Biology 
(1989), John Wiley & Sons, New York, NY and supplements. 

5 Isolation of the Staphylococcus epidermidis femA gene via the polymerase chain reaction 

Isolated plasmid pPSJ180 is used as the template for the polymerase chain reaction at a concentration of 
1 0 ng/reaction. VentR™ DNA polymerase (2 units/jil, Catalog #254, New England Biolabs, 32 Tozer Road, Bev- 
erly, MA 01915-9965) is used with standard VentR™ DNApolymerase buffer (1X= 10 mM KCI, 20 mM Tris-HCI 
10 (pH 8.8 at 25°C) P 10 mM (NH 4 ) 2 S0 4 , 2 mM MgS0 4 , 0.1% Triton X-100). The PCR primers used are AGATA- 
TAAAGATCTAGATGGGAGTTATGAA (SEQ ID NO: 3) and ATTTCATAATTAGATGGATCCCTTCTTAAAATC 
(SEQ ID NO: 4). The reaction is carried out by 3 cycles of 94°C for 1 5 seconds, 40°C for 1 5 seconds and 72°C 
for 1 minute followed by 20 cycles of 94°C for 15 seconds, 55°C for 15 seconds and 72°C for 1 minute. 

The reaction is transferred to a Centricon 100 microconcentrator (Amicon, Inc., 72 Cherry Hill, Beverly, 
15 MA 01 91 5) and washed with 1 ml of water. The microconcentrator is then subjected to centrif ugation at 3000 
rpm in a microcentrifuge for 30 minutes. The reaction is then diluted to 200 *U (from -50 pJ> with 1X Xbal re- 
striction enzyme buffer. To this is added 50 units of Xbal and 50 units of BamHI. The DNA is then digested for 
90 minutes at 37°C. The DNA is phenol extracted and ethanol precipitated. 

20 Example 2 

Construction of an Expression Plasmid Containing the femA Gene 

The DNA created in Example 1 is then ligated to the 5.6 kb Xbal-BamHl fragment of pET-11 A (available 
25 from Novagen, Inc. (565 Science Drive, Madison, Wl 53711) under catalog #69436-1. This plasmid is then 
transformed into E. coli BL21 (DE3) (also available from Novagen, Inc. under catalog #69387-1) using tech- 
niques well known to those of ordinary skill in the art 

Example 3 

30 

Expression of the FemA Protein 

E. coli BL21 (DE3) transformed with the femA expression plasmid are grown overnight in TY broth (per 
liter 10 g tryptone, 5 g yeast extract and 5 g NaCI) and 100 ng/ml ampicillin. The cells are then diluted 1/50 
35 into TY broth + ampicillin and grown at 37°C for 60 minutes. Expression is induced by adding ispropylthioga- 
lactoside (IPTG) to 0.4 mM. Samples are taken at 0 and 6 hours and run on a % SDS-polyacrylamide gel using 
techniques described in Sambrook et al., Molecular Cloning: A Laboratory Manual (1988), pp. 18.47-18.59, 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. An induced protein band is visible by staining 
with Coomassie Blue at the predicted size of 49,000 daltons. 

40 



45 



50 



55 



13 



JDOCID: <EP 0625575A2_L> 



EP0 625 575 A2 



SEQUENCE LISTING 



GENERAL INFORMATION: 

(i) APPLICANT: ELI LILLY AND COMPANY 

(B) STREET: Lilly Corporate Center 

(C) CITY: Indianapolis 

(D) STATE: Indiana 

(E) COUNTRY: United States of America 

(F) ZIP: 46285 

(ii) TITLE OF INVENTION: FEMA GENE OF STAPHYLOCOCCUS 
EPIDERMIDIS, FEMA PROTEIN, AND VECTORS AND 
MICROORGANISMS COMPRISING THE FEMA GENE 

(iii) NUMBER OF SEQUENCES: 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: C. M. Hudson 

(B) STREET: Erl Wood Manor 

(C) CITY: Windlesham 

(D) STATE: Surrey 

(E) COUNTRY: United Kingdom 

(F) ZIP: GU20 6PH 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: Macintosh 

(C) OPERATING SYSTEM: Macintosh 7.0 

(D) SOFTWARE: Microsoft Word 5.1 
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10 



15 



20 



25 



30 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1257 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..1257 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATG AAG ATG AAG TTT ACG AAT TTG ACA GCT AAA GAA TTT AGT GAC TTT 48 
Met Lys Met Lys Phe Thr Asn Leu Thr Ala Lys Glu Phe Ser Asp Phe 
15 10 15 



35 



40 



45 



50 



55 
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10 



ACT GAT CGT ATG ACA TAT AGT CAT TTT ACA CAA ATG GAA GGT AAT TAC 96 
Thr Asp Arg Met Thr Tyr Ser His Phe Thr Gin Met Glu Gly Asn Tyr 
20 25 30 

GAA TTA AAG GTT GCT GAA GGT ACC GAG TCA CAT TTA GTT GGA ATT AAA 144 
Glu Leu Lys Val Ala Glu Gly Thr Glu Ser His Leu Val Gly He Lys 
35 40 45 

AAT AAT GAT AAC GAA GTG ATT GCA GCT TGT TTA TTA ACA GCT GTT CCT 192 
Asn Asn Asp Asn Glu Val He Ala Ala Cys Leu Leu Thr Ala Val Pro 
50 55 60 

GTA ATG AAA ATA TTT AAA TAT TTT TAT TCC AAT CGC GGT CCA GTA ATA 240 
Val Met Lys He Phe Lys Tyr Phe Tyr Ser Asn Arg Gly Pro Val He 
65 70 75 80 

GAT TAT AAT AAT AAA GAG CTT GTA CAT TTT TTC TTT AAT GAA TTG AGT 288 
Asp Tyr Asn Asn Lys Glu Leu Val His Phe Phe Phe Asn Glu Leu Ser 
85 90 95 

20 AAA TAT GTA AAA AAA TAT AAT TGT TTA TAT TTA AGA GTT GAC CCA TAC 336 

Lys Tyr Val Lys Lys Tyr Asn Cys Leu Tyr Leu Arg Val Asp Pro Tyr 
100 105 HO 



15 



25 



30 



35 



CTT CCA TAT CAA TAT TTA AAT CAT GAG GGA GAA ATA ACT GGA AAT GCA 384 
Leu Pro Tyr Gin Tyr Leu Asn His Glu Gly Glu He Thr Gly Asn Ala 
115 120 125 

GGT CAT GAT TGG ATT TTT GAT GAA TTA GAG AGT TTA GGA TAT AAA CAC 432 
Gly His Asp Trp He Phe Asp Glu Leu Glu Ser Leu Gly Tyr Lys His 
130 135 140 

GAA GGA TTC CAC AAA GGA TTT GAT CCT GTA TTA CAA ATC CGA TAT CAT 480 
Glu Gly Phe His Lys Gly Phe Asp Pro Val Leu Gin He Arg Tyr His 
145 150 155 160 

TCT GTT CTA AAT TTA GCA AAC AAA AGT GCT AAT GAT GTT TTA AAA AAC 528 
Ser Val Leu Asn Leu Ala Asn Lys Ser Ala Asn Asp Val Leu Lys Asn 
165 170 175 



ATG GAT GGT TTA AGA AAG CGT AAT ACT AAA AAA GTT AAG AAA AAT GGA 576 
Met Asp Gly Leu Arg Lys Arg Asn Thr Lys Lys Val Lys Lys Asn Gly 
40 180 185 190 

GTT AAA GTC CGC TTT TTA TCT GAA GAA GAG TTA CCT ATA TTT AGG TCA 624 
Val Lys Val Arg Phe Leu Ser Glu Glu Glu Leu Pro He Phe Arg Ser 
195 200 205 

45 TTT ATG GAG GAT ACC TCT GAA ACT AAA GAT TTT GCA GAT AGA GAA GAT 672 

Phe Met Glu Asp Thr Ser Glu Thr Lys Asp Phe Ala Asp Arg Glu Asp 
210 215 220 

AGT TTT TAT TAC AAC AGA TTC AAA CAT TAT AAA GAC CGT GTT TTA GTA 720 
50 Ser Phe Tyr Tyr Asn Arg Phe Lys His Tyr Lys Asp Arg Val Leu Val 

225 230 235 240 



55 
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CCA CTA GCC TAT ATT AAC TTT GAT GAG TAT ATA GAG GAA CTA AAT AAT 768 
Pro Leu Ala Tyr lie Asn Phe Asp Glu Tyr lie Glu Glu Leu Asn Asn 
245 250 255 

GAA AGA AAT GTG CTT AAT AAA GAT TAT AAT AAA GCT TTA AAA GACf ATT 816 
Glu Arg Asn Val Leu Asn Lys Asp Tyr Asn Lys Ala Leu Lys Asp lie 
260 265 270 

GAG AAA CGT CCA GAG AAT AAA AAA GCA CAT AAC AAA AAG GAA AAT TTA 864 
Glu Lys Arg Pro Glu Asn Lys Lys Ala His Asn Lys Lys Glu Asn Leu 
275 280 285 

GAA CAA CAA CTC GAT GCA AAT CAG CAA AAA ATT AAT GAA GCT AAA AAC 912 
Glu Gin Gin Leu Asp Ala Asn Gin Gin Lys lie Asn Glu Ala Lys Asn 
290 295 300 

TTA AAA CAA GAA CAT GGC AAT GAA TTA CCC ATC TCT GCT GGC TTC TTT 960 
Leu Lys Gin Glu His Gly Asn Glu Leu Pro lie Ser Ala Gly Phe Phe 
305 310 315 320 

ATA ATT AAT CCG TTT GAA GTA GTT TAC TAC GCT GGT GGA ACT TCA AAT 1008 
lie lie Asn Pro Phe Glu Val Val Tyr Tyr Ala Gly Gly Thr Ser Asn 
325 330 335 

CGT TAT CGC CAT TTT GCA GGG AGC TAT GCG GTT CAA TGG AAG ATG ATT 1056 
Arg Tyr Arg His Phe Ala Gly Ser Tyr Ala Val Gin Trp Lys Met lie 
340 345 350 

AAC TAT GCA ATT GAA CAT GGT ATT AAT CGG TAT AAT TTC TAT GGT ATT 1104 
Asn Tyr Ala lie Glu His Gly lie Asn Arg Tyr Asn Phe Tyr Gly lie 
355 360 365 

AGT GGT GAC TTT AGT GAA GAT GCT GAA GAT GCT GGC GTA GTT AAG TTT 1152 
Ser Gly Asp Phe Ser Glu Asp Ala Glu Asp Ala Gly Val Val Lys Phe 
370 375 380 

AAA AAG GGC TAT GAT GCC GAT GTT ATA GAA TAC GTT GGT GAC TTT ATT 1200 
Lys Lys Gly Tyr Asp Ala Asp Val lie Glu Tyr Val Gly Asp Phe lie 
385 390 395 400 

AAA CCT ATT AAT AAA CCA ATG TAT AAC ATT TAT AGA ACA CTT AAA AAA 1248 
Lys Pro lie Asn Lys Pro Met Tyr Asn lie Tyr Arg Thr Leu Lys Lys 
405 410 415 

CTA AAG AAA 1257 
Leu Lys Lys 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 419 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Lys Met Lys Phe Thr Asn Leu Thr Ala Lys Glu Phe Ser Asp Phe 
15 10 15 

Thr Asp Arg Met Thr Tyr Ser His Phe Thr Gin Met Glu Gly Asn Tyr 
20 25 30 

Glu Leu Lys Val Ala Glu Gly Thr Glu Ser His Leu Val Gly lie Lys 
35 40 45 

Asn Asn Asp Asn Glu Val He Ala Ala Cys Leu Leu Thr Ala Val Pro 
50 55 60 

Val Met Lys lie Phe Lys Tyr Phe Tyr Ser Asn Arg Gly Pro Val He 
65 70 75 SO 

Asp Tyr Asn Asn Lys Glu Leu Val His Phe Phe Phe Asn Glu Leu Ser 
85 90 95 

Lys Tyr Val Lys Lys Tyr Asn Cys Leu Tyr Leu Arg Val Asp Pro Tyr 
100 105 HO 

Leu Pro Tyr Gin Tyr Leu Asn His Glu Gly Glu He Thr Gly Asn Ala 
115 120 125 

Gly His Asp Trp He Phe Asp Glu Leu Glu Ser Leu Gly Tyr Lys His 
130 135 140 

Glu Gly Phe His Lys Gly Phe Asp Pro Val Leu Gin He Arg Tyr His 
145 150 155 160 

Ser Val Leu Asn Leu Ala Asn Lys Ser Ala Asn Asp Val Leu Lys Asn 
1S5 170 175 

Met Asp Gly Leu Arg Lys Arg Asn Thr Lys Lys Val Lys Lys Asn Gly 
180 185 190 

Val Lys Val Arg Phe Leu Ser Glu Glu Glu Leu Pro lie Phe Arg Ser 
195 200 205 



18 



EP 0 625 575 A2 



Phe Met Glu Asp Thr Ser Glu Thr Lys Asp Phe Ala Asp Arg Glu Asp 
210 215 220 

Ser Phe Tyr Tyr Asn Arg Phe Lys His Tyr Lys Asp Arg Val Leu Val 
225 230 235 240 

Pro Leu Ala Tyr lie Asn Phe Asp Glu Tyr He Glu Glu Leu Asn Asn 
245 250 255 

Glu Arg Asn Val Leu Asn Lys Asp Tyr Asn Lys Ala Leu Lys Asp lie 
260 265 270 

Glu Lys Arg Pro Glu Asn Lys Lys Ala His Asn Lys Lys Glu Asn Leu 
275 280 285 

Glu Gin Gin Leu Asp Ala Asn Gin Gin Lys He Asn Glu Ala Lys Asn 
290 295 300 

Leu Lys Gin Glu His Gly Asn Glu Leu Pro He Ser Ala Gly Phe Phe 
305 310 315 320 

He He Asn Pro Phe Glu Val Val Tyr Tyr Ala Gly Gly Thr Ser Asn 
325 330 335 

Arg Tyr Arg His Phe Ala Gly Ser Tyr Ala Val Gin Trp Lys Met He 
340 345 350 

Asn Tyr Ala He Glu His Gly He Asn Arg Tyr Asn Phe Tyr Gly He 
355 360 365 

Ser Gly Asp Phe Ser Glu Asp Ala Glu Asp Ala Gly Val Val Lys Phe 

370 375 380 

Lys Lys Gly Tyr Asp Ala Asp Val He Glu Tyr Val Gly Asp Phe He 
385 390 395 400 

Lys Pro He Asn Lys Pro Met Tyr Asn He Tyr Arg Thr Leu Lys Lys 
405 410 415 

Leu Lys Lys 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 

AGATATAAAG ATCTAGATGG GAGTTATGAA 30 
(2) INFORMATION FOR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

35 

ATTTCATAAT TAGATGGATC CCTTCTTAAA ATC 33 

40 

Claims 

1. A DNA compound that comprises an isolated DNA sequence encoding SEQ ID NO: 2. 
45 2. The DNA compound of Claim 1 which comprises the isolated DNA sequence which is SEQ ID NO: 1 . 
3. A recombinant DNA vector that comprises the isolated DNA sequence of Claim 1 . 



4. A recombinant DNA vector of Claim 3 that further comprises a promoter positioned to drive expression 
of said isolated DNA sequence. 

5. A method for constructing a recombinant host cell capable of expressing SEQ ID NO: 2, said method com- 
prising transforming a host cell with a recombinant DNA vector that comprises an isolated DNA sequence 
of Claim 1. 

55 6. A method for expressing SEQ ID NO: 2 in a recombinant host cell; said method comprising culturing said 
transformed host cell of Claim 5 under conditions suitable for gene expression. 

7. A recombinant host cell transformed with a recombinant DNA vector of Claim 3. 

20 
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8. The protein, in purified form, encoded by SEQ ID NO:2. 
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FIG. 2 
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